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but not on the length of the series n. Under appropriate assumptions, it is 
shown that a Gaussian quasi-maximum likelihood estimator is almost surely 
consistent and asymptotically normal. The theoretical results are illustrated by 
means of two examples of bivariate processes. It is shown that the assumptions 
underlying the theoretical results apply. In the second example the innova¬ 
tions are also marginally heteroscedastic with a correlation ranging from —0.8 
to 0.8. In the two examples, the asymptotic information matrix is obtained in 
the Gaussian case. Finally, the finite-sample behaviour is checked via a Monte 
Garlo simulation study for n going from 25 to 400. The results confirm the 
validity of the asymptotic properties even for short series and reveal that the 
asymptotic information matrix deduced from the theory is correct. 

Key words and phrases : Non-stationary process; multivariate time series; time- 
varying models. 

Running title: Asymptotics of QMLEs for tdVARMA models 

*Universite Moulay Ismail, Faculte des Sciences juridiques, economiques et sociales, B.P. 3102 
Toulal, Meknes, Morocco. This work was started while at Universite libre de Bruxelles, Faculty 
of Sciences, Dept of Mathematics, Blv du Triomphe CP 210, B-1050 Brussels, Belgium (e-mail: 
abdelkamel. alj @gmail.com). 

lUniversite libre de Bruxelles, Faculty of Sciences, Dept of Mathematics, Blv du Triomphe CP 
210, B-1050 Brussels, Belgium (e-mail: chrisley@ulb.ac.be). 

lUniversite libre de Bruxelles, SBS-EM, ECARES, avenue Eranklin Roosevelt 50 CP 114/04, 
B-1050 Brussels, Belgium (e-mail: gmelard@ulb.ac.be). 


1 



1 Introduction 


A large part of the literature on time series models is concerned with stationary mod¬ 
els. This is of course due to the ensuing mathematical simplihcations of stationarity. 
Even in that simple context, an asymptotic analysis is not necessarily easy; indeed, 
the celebrated autoregressive-moving average (ARMA) models popularized by Box 
and Jenkins (Box et al., 2008) require several pages in Brockwell & Davis (1991, pp. 
375-396) for the derivation of their asymptotic properties. However, the assumption 
of invariance over time (especially for long time intervals) is difficult to justify in 
most practical situations. Therefore, recent years have seen an increasing interest 
in models with time-dependent coefficients and non-stationary time series. Initiated 
by the seminal work Quenouille (1957), models with time-dependent or time-varying 
coefficients for univariate time series have been investigated over the years by, inter 
alia, Whittle (1965), Subba Rao (1970), Tjpstheim (1984), Kwoun & Yajima (1986), 
Singh & Peiris (1987), Priestley (1988), Grillenzoni (1990), Dahlhaus (1996a, b, c, 
1997), Bibi & Francq (2003), Azrak & MGard (2006) and Triantafyllopoulos & Nason 
(2007). We refer to the introduction of Azrak & Melard (2006) or of Van Bellegem & 
Dahlhaus (2006) for further references. In several of these papers, the coefficients of 
the ARMA models are not constant but are deterministic functions of time. Also the 
innovation variance can be a deterministic fnnction of time instead of being constant, 
snch as in Van Bellegem & von Sachs (2004). We can speak of marginal heteroscedas- 
ticity by opposition to conditional heteroscedasticity which is enconntered in ARCH 
and GARCH models. All these functions of time are snpposed to depend on a small 
number of parameters. Other somewhat related recent approaches include generalized 
antoregressive score (GAS) models of Creal et al. (2013) and testing parameter con¬ 
stancy against deterministically time-varying parameters, e.g. Terasvirta et al (2010, 
Section 6.3) and references therein, generalized to VAR models in Terasvirta & Yang 
(2014). 

The present paper inscribes itself in this line of research but for multivariate time 
series. The generality of the models we consider evidently entails numerous challenges, 
since the convenient asymptotic theory of stationary ergodic processes does no longer 
apply. Also, the asymptotic theory of time series models makes a large use of Fourier 
transforms and, conseqnently, of what is called spectral analysis. As a consequence, 
deriving conditions for consistency and asymptotic normality of estimators of the 
coefficients, as well as obtaining the asymptotic covariance matrix, becomes highly 
complicated. 
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We consider multidimensional time series models, with particular emphasis on vec¬ 
tor ARMA (VARMA) models, the multivariate extension of the ARMA models. The 
main difference between ARMA and VARMA models lies in the fact that the coeffi¬ 
cients change from scalars to squared matrices. The main developments in the area of 
statistical inference of standard stationary VARMA models are due to Kohn (1978), 
Hannan & Deistler (1988), Francq & Rai'ssi (2007) and, quite recently, Boubacar 
Mainassara & Francq (2011) who study the consistency and asymptotic normality of 
quasi-maximum likelihood estimators for weak VARMA models. However, the field 
of time-dependent VARMA (tdVARMA) models with marginally heteroscedastic in¬ 
novation covariance matrix remains largely unexplored. An exception is Dahlhaus 
(2000) using an entirely different approach and assuming that the coefficients depend 
on time t but also on the length of the series n trough their ratio t/n. Here we assume 
dependency on t only. Even if our theory is illustrated on pure VAR examples, it 
should be emphasized that it is valid for VMA and VARMA models, like in Dahlhaus 
(2000). Note that Liitkepohl (2005, Chap. 14) treats tdVAR models by Gaussian 
maximum likelihood but does not discuss asymptotic properties in the general case. 

Thus we want to fill in this gap in the literature by extending to the multivariate 
setting the methodology of Azrak & Melard (2006) who, to the best of the authors’ 
knowledge, were the first to obtain asymptotic properties of estimators for the gen¬ 
eral class of univariate time-dependent ARMA models by having recourse to quasi¬ 
maximum likelihood estimation (QMLE). 

Like other QMLE approaches, the estimation method in Azrak & Melard (2006) 
does not use the true, unknown, density of the observations but rather acts as if that 
density were Gaussian, thus using the Gaussian log-likelihood, which is an extension 
of the generalized least-squares method since it takes care of possible heteroscedastic- 
ity. There is no assumption of stationarity but, although it is not illustrated in our 
examples, there is an adjustment in the asymptotic theory for allowing non-normal 
observations. One major advantage of QMLE is that the Gaussian likelihood function 
can be computed exactly, with an efficient algorithm, Alj et ah (2015c), and this is 
very important for short time series. The main task in the Azrak-Melard approach, 
hence also in our extension, consists in checking conditions from two crucial theorems 
in Klimko & Nelson (1978), which respectively ensure existence of an almost surely 
(a.s.) consistent estimator and prove asymptotic normality of that estimator, whilst 
providing the asymptotic covariance matrix. This is precisely what we are aiming at 
but, as we shall see in the rest of this paper, it is all but an easy task. 
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Let us briefly comment on two other univariate approaches, and explain why we 
have not opted for extending those. 

- The Dahlhaus approach (Dahlhaus 1996a, b, c, 1997) has allowed to obtain 
asymptotic results for a class of locally stationary processes including het- 
eroscedastic ARMA processes with time-dependent coefficients. Dahlhaus uses 
either a spectral-based or a maximum likelihood estimation method. His asymp¬ 
totics are based on rescaling time, i.e. t/n; therefore this is not related to our 
approach where there is no such requirement. Moreover local stationarity im¬ 
plies that the coefficients are continuous functions of time (and even two time 
differentiable functions), which is not necessarily the case here. Azrak & Melard 
(2011) show a univariate example where Azrak & Melard (2006)’s theory holds 
but the assumptions of local stationarity are not valid. Note also that Dahlhaus 
(2000)’s theory for multivariate processes assumes a Gaussian process whereas 
we assume only existence of 8th-order moments. 

- The Bibi-Francq approach (Bibi & Francq 2003) applies the quasi-least squares 
estimation method and gives asymptotic results for cyclical ARMA models with 
non constant periods. Although only 4th-order moments are needed, the theory 
was not developed for multivariate processes until now. 

Another related approach, see for instance Tiao & Grupe (1980) or Basawa & Lund 
(2001), consists in ARMA models with coefficients that vary as periodic functions of 
time, see also Hindrayanto et al. (2010). If the period s is an integer, s consecutive 
variables can be stacked as a vector which satishes a stationary VARMA model. Here 
we do not assume periodic coefficients although, to simplify the derivations, our two 
examples will have periodic coefficients or innovation covariance matrix, but with large 
or irrational periods. Therefore stacking the variables will be practically inoperative 
and standard asymptotic theory for stationary VARMA models will not apply. 

Most of the technical lemmas used in the present paper and their proofs are given 
in a technical appendix denoted here ’TA’, see Alj et al. (2015a). The second part of 
this paper (Alj et ah, 2015b) will deal with the more general case where the coefficients 
of the model depend on time t but also possibly on the number of observations n of 
the series. However, even in that case, the theory differs from Dahlhaus’ approach 
in the sense that the coefficients do not need to be continuous functions of time. In 
the univariate case, an example is provided by Azrak & Melard (2011). The technical 
appendix, Alj et al. (2015a), will be shared by Alj et al. (2015b). The reason 
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to separate the material in two parts is that while theorems related to martingale 
sequences are enough in the context of the present paper, the second part requires 
martingale arrays, although the technicalities are more or less the same. 

Another aspect of the present paper is that it provides an alternative theory for 
the asymptotics of standard VARMA models that does not rely on stationarity or 
ergodicity arguments, although in the standard case, the assumptions will imply the 
usual conditions on the roots of the autoregressive and moving average polynomials 
in the lag operator. Our alternative theory also avoids spectral analysis. 

The paper is organized as follows. In Section |2l we hrst develop asymptotics 
for quasi-maximum likelihood estimators in a general multivariate time series model 
which is not necessarily stationary. Then, in Section [3l we focus our attention on 
tdVARMA models: after setting the notations, we analyze pure VAR and pure VMA 
representations, with an illustration, before hnally stating the main theorem for the 
tdVARMA case. We illustrate our theoretical hndings by means of two examples 
in Section 01 and examine the hnite-sample behavior of our estimators via a Monte 
Carlo simulation study in Section [5l Finally, Appendix A collects the main proofs 
and Appendix B contains a verihcation of the main assumptions for the two examples 
studied with a few nice mathematical derivations. 


2 QMLE for a general multivariate time series model 


2.1 Some preliminaries 

Let {xt : f G N} be a stochastic process dehned on a probability space {Q,F,Pg), 
taking values in R'", and whose distribution depends on a vector 9 = (6*i, ...,0^)^ of 
unknown parameters to be estimated, with 6 lying in some open set 0 of a Euclidean 
space R”^. Let Eg{.) and Eg{./.) denote expectation and conditional expectation under 
Eg, respectively. The true value of 9 is denoted by 9^ = (6*°,..., 9^)'^, assumed to be an 
interior point of 0. Let {Ft : f G N} be an increasing sequence of sub-sigma algebras 
of F with Ft generated by {xu ■ u = 1, 2,..., f} with Fq = {0, f2} such that, for each t, 
Xt is measurable with respect to Ft. Given a set of observations [xt : t = 1,2, ...,n}, 
we want to estimate 9 by trying to minimize the general real-valued objective function 
Qni.9) = Qn{9;xi,Xn) which depends on 9 and the observations {xt : t = 1,2, 
Therefore we solve the system of equations 


dQnjO) 

d9t 


0 for 


i = 1, 
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We suppose that the objective function Qn(0) is twice continuously differentiable 
in 0. Let 6n = {6i, be a sequence of estimators indexed by n. Klimko & 

Nelson (1978) showed conditions for strong consistency and asymptotic normality of 
On-, see also Hall & Heyde (1980, pp. 174-176) and Taniguchi & Kakizawa (2000, pp. 
97-98). 

2.2 General theory of quasi-maximum likelihood estimation 

Denote 

etiO) = xt- xt/t-i{0) with Xt/t-i{0) = Ee{xt/Ft-i), (2.1) 

for which obviously Ee^etiO)) = 0. We denote by 

T.t{e) = Ee[et{e)e^{e)lEt-i] 

the conditional covariance matrix given Et_i. The quasi-likelihood function L„(0; xi, ...,Xn) 
computed as if the process were Gaussian is given by 

n I' ^ n 

= (2,r)-"’'/2]qdet(E,(9))-'/"exp( --5^er(9)E,-‘(9)e,(9) 

t=i L ^ t=i 

We take the objective function 

1 ” 

Qn(0) = - log (Ln(0; Xi,..., Xn)) = (2.2) 

^ t=i 

with 

aM = log(det(E,m + e[miH0)et(0). 

Then the QMLE of 0 is dehned as any measurable solution of 

argminggQ Qn(&)- (2.3) 

In order to check the assumptions of the Klimko & Nelson (1978) theorems, we 
proceed like Azrak & Melard (2006) and we make some additional assumptions as 
follows. Let the r-vector stochastic process {xt : t G N} be such that Eq (||a;i||^) < cx) 
for all 9 and et{d) and 2^(0) are almost surely twice continuously differentiable in 0. 
Henceforth, for simplicity, we denote [Eo{.{9)}]g^QO by E0o{.{9)}. We suppose that 
there exist two positive constants Ci and C 2 such that for alH > 1: 
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where V = is a strictly positive definite matrix of constants; 


H24 



for i,j = 1 , where 6* is a point of the straight line joining 6^ to every 6, 

snch that || 6 * — 0*^11 <A, 0<A, where ||.|| is the Enclidean norm. 


The main notations and assnmptions being settled, we are ready to state the two 
main theorems of the present paper. 


Theorem 2.1 Suppose that Assumptions H 2.1 — H 2.4 hold. Then there exists a se¬ 
quence of estimators On = ( 6 * 1 ,..., 6 *^)^ such that On 0^ a.s. and, for any e > 0 , 
there exists an event E with Pqo^E) > 1 — e and an Uq such that on E, for any n > uq, 
{dQn{0)/d0i]Q^Q^ = 0 ,/or i = 1 , 2 ,..., m, andQn{0) attains a relative minimum atOn- 

The proof of Theorem 12.11 is given in Appendix lAl 


Theorem 2.2 If the assumptions H 2.1 — H 2.4 are satisfied, as well as H 2 .E 


n 




t=i 


dat{0) dat{0) 

dO dO'^ 


/ Ft_i j- 'y^EgO 


t=i 


dat{0) dat{0) 

dO dO'^ 


a.s. 

^ U as n ^ cx), 


then 


n 


F\0n - 0^) 4 A/(o, v-^wv-^), 


where —> indicates convergence in law and W = is a strictly positive 
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definite matrix defined by 



For the proof of Theorem 12.21 we use the Central Limit Theorem for martingale dif¬ 


ferences of Basawa & Prakasa Rao (1980). 

3 VARMA models with time-dependent coefficients 

3.1 tdVARMA models: definition and notations 

The process {xt : f G N} is called a zero mean r-vector mixed autoregressive-moving 
average process of order (p, q) with time-dependent coefficients, and is denoted by 
tdVARMA (p, q), if and only if it satisfies the equation 



(3.1) 


where p and q are integer constants, {cj : t G N} is an independent white noise 
process, consisting of independent random variables, not necessarily identically dis¬ 
tributed, with zero mean, covariance matrix S which is invertible, and finite fourth- 
order moments, and where the coefficients An, ...,Atp and Bn, ...,Btq, as well as the 
r X r matrix gt, are deterministic functions of time t. The initial values Xt,t < 1, 
and et, t < 1, are supposed to be equal to the zero vector. In the sequel, we will 
also use = Bto = with the r-dimensional identity matrix and set to zero the 
coefficients At^ with k > p and Btk with k > q, for all t. Writing 0 the Kronecker 
product, we let 


Kt = E (vecietej) vec{etejf) = E {{etej) 0 (etej)) , 


which depends on t, in general. For /c, / G N and k ^ I, we consider the matrix 


E (yec{et-kej_i) vec{et-ie, 


T 

t—k 


f) = a:,,,(S0S), 


which does not depend on t,k oil and where the x matrix is the commutation 
matrix. See Kollo & von Rosen (2005, p. 79) or TA Lemma 4.2. 





Let us now consider the parametric model corresponding to fl3.1l) . namely 


p 


Xt — '^^Ati{6)xt-i + et(0) + Btj{6)et-j{0), 


(3.2) 


2 = 1 


where the et{9) can be considered as the residuals of the model and are defined as in 
fl2.ip and where Ati{6), for i = 1, and for j = 1,g, are the parametric 

coefficients. Furthermore the covariance matrix of et{9) is parametrized as 


S,(0) {et{9)e^{9)) = {9). 


For 9 = 9^, we have Au{9°) = Au, Btj{9°) = Btj, gt{9'^) = gt, et{9°) = gtet and 
Et = E [et{9^)ef (9^)) = gt^gf ■ We assume that the m-dimensional 


vector 9 contains all the parameters of interest to be estimated, those in the coefficients 
Ati{9),..., Atp{9), Bti{9),Btq{9) and gt{9) but not the nuisance parameters in the 
scale factor matrix S which are estimated separately. In usual VARMA(p, q) models, 
the coefficients Ai{9 ),..., Ap{9), Bi{9 ),..., Bg{9) and gt{9) do not depend on t, and the 
parameters are the coefficients themselves. Note that for a given 9 we have 


Ee{xt/E^_^) = J2M0)xt-i + J2Btj{9)et_,{9). 


xt/t-i{9) 


According to the assumptions made about initial values, it is possible to write out 
properly the pure autoregressive and the pure moving average representation of the 
model fl3.2p . as we shall see in the next section. 

3.2 The pure autoregressive and the pure moving average 
representations 

By using the assumption about initial values and using f|3.2p recurrently, Melard 
(1985) and Azrak & Melard (2015) have established expressions for the pure autore¬ 
gressive representation and the pure moving average representation of tdVARMA pro¬ 
cesses. In the univariate case these representations can be found in Azrak & Melard 
(2006). In our setting, for any 9 the pure autoregressive representation corresponds 
to 


t-i 



(3.3) 


k=l 
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where the coefficients TTikid) can be obtained from the autoregressive and moving 
average coefficients by using the following recurrences (see Melard, 1985, pp. 43-45): 


7Tlf{e) = Atj{e), = for j = 

for j = k,...,t-l, 

- ^lt~^\0)Bt-k,j-k{0), for j = k + l, f - 1, 

and TTtkiO) = 7r|^^(6*) for k = — 1. By fl3.3p we of course have et{6) = Xt — 

'Yl^k2iT^tk{0)xt-k^ and consequently its hrst three derivatives with respect to 6 are 
given by 

det{e) 

de, 

d^et{6) 

dOidOj 

d'^etjO) 
dOidOjdOi 

for i, j, I = 1, m. 

On the other hand, for the pure moving average representation we have 

t-i 

Xt = et{e) + '^'i/Jtk{0)et-k{0), ( 3 . 5 ) 

k=l 

where the coefficients ipikiG) = '^ik\^)^ = 0,1, ...,t — 1, can be obtained from the 

autoregressive and moving average coefficients by using the following recurrences (see 
Melard, 1985, pp. 36-38): 

4^(6) = fo, 4f{e) = Bt,{e), 4f{e) = At,{e), for j = i,...,f-i, 

4j\^) = +4k~^\^)Bt-k,j-k{0), for j = k,...,t-l, 

4j\^) = +4k~^\^)^t-k,j-k{0), for j = k + l,...,t-l, 

for each fc = l,...,f — 1. Hence for 6 = 6^ we have 

t-i 

Xt 9t^t 4“ ^ ^ '4^tk9t—k^t—ki 
k=l 


t-1 


^ dTTtk (0) 

/ j c\n Xt—ki 


k=l 

t-1 


de. 


/ ^ ci/1 Q/i Xt—ki 


k=l 

t-1 


89,89 


(3.4) 




^ 8^7itk{9) 

/ ^ 0/1 Ci/i Cl/1 ^t—kj 


k=l 


89i89j89i 
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where we denote V’tfc = Then, by using flS.Sp . et{0) and its hrst three deriva¬ 

tives in (13.4p can be written as a pure moving average in terms of the innovations 
process: 

t-i 


— gtCt + 'f’tokjd, 6^)gt-k^t-ki 

k=l 

(3.6) 

J ^ = ^iJtik{0,0^)gt-ket-k, 

* k=l 

(3.7) 

QO.QQ. = ^^tijk{0,e'^)gt-ket-k, 

* ^ k=i 

(3.8) 

d^et{9) 

dOMidOi ~ )9t-ket-k. 

* ‘ k=i 

(3.9) 


for z, j, I = 1,m, where the coefficients iptokiO, 6'°), iJukiO, 6^),i’ujk{0, 0°) and t/JujikiO, 6^) 
are obtained from the autoregressive and moving average coefficients by the following 
relations: 


k 


fJtok{0,0^) 



M9,9°) 

d'^tu{9) 

/ . Yt—u,k—u-) 

(3.10) 

i’tijk{9,9^) 

•s^ d‘^TTtu{9) ^ 

~ ^ 89,89, '^t-u,k-u, 

U=l * -7 

(3.11) 

'iftijik{9,9^) 

8^T^tui9) 

^ 89,89.89i'"'^~'"' 

U=1 

(3.12) 


We denote 

0°), iJtik = 'iptik{d^, 0^), ‘iptijk = ‘iptijkiO^, 0^) and 'ijjujik = i’ujikid^, 0°). 

Remark 3.1 If the process were not started at time t = 1, it should be necessary to 
impose a causality and an invertibility condition, see for example Hallin & Ingenbleek 
(1983) and Hallin (1986). Note that 'iptok{0^■, = 0, for k>l, and 1, for fc = 0. 
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3.3 An illustration: tdVARMA(l, 1) 

Let {xt : t G N} be an r-vector time series satisfying 


Xt — At{6)xt-i + Gt{6) + Bt{6)et-i{6). 


It is easy to see that the tdVARMA(l,l) model is considered as a special case of the 
model dehned in (13.51) with p = 1 and q = 1- Now following Melard (1985) and Azrak 
& Melard (2015), in this special case, the coefficients of the pure moving average 
representation are given by: 


+ A.k+my for k = l,2,...,f- 1, 

^ 1=0 


where a product for / = 0 to —1 is set to A- The coefficients of the pure autoregressive 
form are 

k-2 

'Ktkie) = + 

1=0 

so, for i = 1, their derivatives are given by 

dMO) dA{9) dBt{9) 

d9i d9i d9i 


dnt2{9) 

d9i 


^^{A,_,{9) + B,_,{9)} - B,{9) 


dA.,i9) dBt.ii9)\ 

09, 09, J 


O'XtoiO) 

09, 


QBt{9) 

09, 


Bt_^{9){A_2{9) + Bt_2{9)] 


+ Bt-2{9)] 


09, 

B^(9)B^_^{9) 


0At_2{9) ^ 0Bt-2{9) 


09i 


09, 


Consequently 


where 


0ntk{9) 

09, 




—h,k,l,h 




l=l \h=l 


1 




dxt,k,l,hiA 

ddi 

Xt,k,l,hi^) 


if h = l, 
if h^l, 
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and 


Bt{9) if h < k 

At{e) + Bt{e) if h = k. 


Xt,k,l,hi9) 


Then 


k ( u / u 


Aik{e,e^) = EiE n»+i —h,k,l,h,i{9') I ^ A Xt+l—h,k,h{9 


u=l (=1 \h=l 


. h=u+l 


Xt+l—h,k,h{9') 


At{9) if h<k, 

At{9) + Bti9) if h = k. 


In the univariate case these results can be found in Azrak & Melard (2015, Chapter 
4), correcting Azrak & Melard (2006). 


3.4 QMLE of tdVARMA(]3, q) models: asymptotic results 

Let {xt : t = l,2,...,n} be a partial realization of length n of the process {xt : 
f G N} defined in (13.Ih . In the present section, we shall apply the general results of 
Sectionto the tdVARMA(p, q) setting after formulating the minimal requirements 
for satisfying Assumptions H 2.1 —ii 2 A (resp., H 2 ,i —H 2 , 5 ). The notations Qn{9), at{9) 
as well as the QMLE solution fl2.3p remain of course the same here. 

Theorem 13.11 below establishes the strong consistency of the QMLE and further 
the asymptotic normality of this estimator. For convenience, we suppose that the pa¬ 
rameters in Ati{9) for i = 1, ...,p, in Btj{9) for j = 1,..., g, and in gt{9) are functionally 
independent. Without loss of generality we suppose that the vector 9 is composed 
of three sub-vectors A, B and g, more concretely 9 = {A^,B^,g^)^, A being the 
sub-vector of the parameters included in Ati{9) for i = 1, with dimension si, B 
the sub-vector of the parameters included in Btj{9) for j = 1,..., g, with dimension S 2 
and g the sub-vector of the parameters included in gt{9) with dimension m — si — S 2 - 
Let us dehne the following Schur or Frobenius matrix norm. 

Definition 3.1 The Schur or Frobenius norm is a matrix norm of an m x n matrix 
A defined as 

\\A\\^ = Vtr(ATA). 

For further information about this matrix norm, see Golub & Van Loan (1996, p. 55). 

We now introduce a set of assumptions that will allow us, as mentioned above, to 
use results from Section [221 We assume for alH G N; 
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Hs.i ; The matrices Ati{9), Btj{6) and gt{0) are three times continuously differentiable 
with respect to 9, in an open set 0 which contains the true value 9^ of 9. 

H 3.2 ; There exist positive constants iVi, N 21 N^, iV 4 , and 0 < < 1 such that, 

for z/ = 1 ,t — 1 , 


t-i 


t-i 


k=v k=v 

t-1 t-l 

k=v 

t-l 

^WiJujikfp < ^ 5 , hi,I = 


k=v 


k=l 


H 3.3 ; There exist positive constants K 2 , K 3 , K 4 , such that 


09 , J 


<K,, 


e=eo\ 

d^T\ 0 ) \ 

09i ) 0 =e 


0^T.t{9) \ 

09,09j i 0 = 0 o 


<K 2 , 


1 09,09,9i J 


9=90 


<Ks, 



2 

< Ka, 

lO^Yi\9)\ 

90 

F 

\ 09,09j 


< it's, = 


H 3.4 ; There exist positive constants Mi, M 2 , and M 3 such that 


E[{et^etY] <Mi, \\E 


T ^ gT 


t J IIf 


< M 2 , 


II^zIIf + lr 6 c(S). vec(S)^||^ + ||S 0 Slip, + ||-h'r,r(S 0 S)||p, < M 3 . 
H 3.5 ; There exist positive constants mi and m 2 such that 


\\9tfF<mi, ||S/||^<m 2 . 


Furthermore, we suppose that 


H 


3.6 


lim — 

n—>-co n 


n . 

t=i f 


( OeJ (9) 
V 09, 




(^) 


0et{9) 


09, 


1 


^_,OYi{9)^_,OYi{9) 


09i 


09i 


9=90 


= v,„ 
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for i, j = 1,m, where the matrix V = is a strictly positive dehnite 

matrix; 

^ n—1 n—d t—1 

IIS't-fcIliT’ IIV’tifcllF IIV’t+(i,i,fc+d|| 

d=l t=l k=l 



n—1 n—d 


n 


d=l t=i 


"t-i 


^tokk^t-k^tdkk 

.k=l 


t-1 t-1 

ki=lk 2 =l 


+ EEAah,(S0S)A&* 

ki = l k2=l 



with 

Ht(E) = Kt(E) — vec(E). vec(E)^ — (E $$ E) — ® E), 

and, for k', k" = k, ki, /c 2 , 


Kfk'k" = vec(^i^fc,V'i+/,i,fc'+/E4+j^/’t+/,yfc»+/c/i-fc»), / = 0, d. 


Remark 3.2 These assumptions are a generalization of those in Azrak & Melard 
(2006). Note however that their assumption about upper bounds of the 4th order 
moment of the process is not needed in their proof. It will also not be used here, hence 
it is left out. 

With these assumptions in hand, we are able to show that the conditions for 
Theorems 12 . If ^I72\ hold (see Appendix A.l for a sketch of the proof) and hence obtain 
the following result. 

Theorem 3.1 Suppose that Assumptions H 3 . 1 -H 3 7 hold. Then there exists a se¬ 
quence of estimators On = ( 6 * 1 ,..., 6 m)^ such that 

• On ^ 0^ a.s., and for every e > 0 there exists an event E with Pqo{E) > 1 — e 
and an uq such that, for n > uq on E, Qn{0) reaches a relative minimum at the 
point On,' 
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- 0°) A AA(0, V-^WV-^), with 


T;r^ 1 - 1 17 (dat{9) dat{9) 


t=l 


Remark 3.3 For the sake of simplicity in the proof of Theorem \3.1\ (see Appendix 
A.3), we have made assumptions on in addition to those on g^. The proof is 
somewhat similar to that of Azrak & Melard (2006) but is extended to multivariate 
processes. Note however several corrections with respect to that paper (e.g. the treat¬ 
ment of the third term of flA.4p ; 9^ and 9 were sometimes not distinguished where they 
should, especially in Section and improvements (the treatment of the last three 
terms of flA.4D is more detailed; also the role of the assumptions is better enlightened). 


4 Some examples 

The two examples will show that the theory can be applied and that the assnmptions 
can be verihed. 

4.1 Example 1: tdVAR(l) a generalization of Kwoun &: Ya- 
jima (1986) 

In the following example we discnss a generalization of Kwonn & Yajima (1986). To 
achieve this we consider the bivariate tdVAR(l) model 



Af Af 


Xt-1,1 

Xt-1,2 



(4.1) 


where the coefficients Al^{9) for i,j = 1,2 are dehned as 

A't’W = A;sm(a„t+ A",). 


(4.2) 


The nnknown parameters AS and A'k are snch that AS G [5,1 — 5] and AS g [0, 27r —5] 
for some hxed 1/2 > 5 > 0 and are known constants. Then 


f) — (A' A' A' A' A” A" A” A" V 

^ — 1 ^ 11 ) ^2li ^ 12 ) ^ 22 ) ^ 11 ) ^2li ^ 12 ) ^22) ■ 
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The numerical example proposed by Kwoun & Yajima (1986) for r = 1 contains 
a process with periodic coefficients of period 4 because aij = ti/2. However, it is 
well known, see e.g. Tiao & Grupe (1980) and Azrak & Melard (2006), that an r- 
dimensional autoregressive process with periodic coefficients of period s G N can 
be embedded into an s-dimensional stationary autoregressive process. To avoid this 
simplification we consider coefficients (6) either with distinct irrational periods or 
at least with large relatively prime periods. We check the assumptions of Theorem 
13.11 in Appendix IB. II in the simplified case where we have 


A{9) 


sin (at) 
0 


1 

2 

^22 sin( 6 f) 




( 4 . 3 ) 


with 9 = A' 22 )'^. For the simulation study, we shall rather have recourse to the 

model 


with 


MO) 


A'^i sin(at) A\ 


12 


0 


A '22 sm{ht) 


9 — (2I11, Aj_ 2 , A22) , 


27r 

V2499 


and b 


271 

V2399' 


(4.4) 


(4.5) 


We impose that G [0,1[, partly similarly to Kwoun & Yajima (1986), and 

write 0 ° = (A;o,0.5, ^ 22 )^ for the true value of 9. For simplicity, we take here T, = I 2 . 


4.2 Example 2: tdVAR(l) with heteroscedasticity 

Let us re-consider the model defined in fl4.ip - fl4.3l) . except that 4^2 is no longer a 
parameter, with the added difficulty that the innovations are now gtet instead of e*. 
Therefore we introduce a matrix gt{9) and we have a bounded time-dependent covari¬ 
ance matrix T,t{9) = gt{9)T,gf (9). We have extended the Kwoun & Yajima (1986) 
parametrization by taking 

f exp (-7711 sin(cf)) 1 

I -1 exp (-7722 sin(cf)) 
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with c G M. Also we use a matrix S which is no longer the identity matrix: 


E := ( ] . 

\ Si 2 S22 J 

A close examination of St(0) shows that if gt{0) were diagonal, then the correlation be¬ 
tween the residuals would be constant, which is not very realistic for a time-dependent 
process. This is why we have put off-diagonal elements different from 0 in (14.bh . Here, 
the vector 9 reduces to 


^ — (^ 11 )^ 22 )^ 115 ^ 22 )^- 

The assumptions of Theorem 13.II are checked in Appendix IB.2| and simulation results 
shown in Section l5^ 

5 Simulation results 

5.1 Example 1: tdVAR(l) a generalization of Kwoun &: Ya- 
jima (1986) 

The simulation experiment is performed in Matlab by using the program, which we 
call AJM, described in Alj et ah (2015c) and based on a special case of tdVAR(l) 
process dehned in fl4.4p - fl4.5l) . with = 0.8 and A '22 = —0.9 and ( 641 , 642 )^ has a 
bivariate normal distribution with covariance matrix T, = I 2 . A simulated series using 
these specihcations is shown in Fig. 1. 

The true value of 6 is 

0° = (A'°,A?2)^2°2)^ = (O.8,O.5,-O.9)^, 


We take 


= ( 0.1 0.1 0 . 1 )^. 


as initial value of 9. 

The experiment was replicated 1000 times. The results are summarized in Table 
[H As the sample size becomes larger, we can see that 

• the averages of the estimates become closer to their true value in accordance 
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Table 1: Estimation results for the model (HU under (14.411 - (14.51) via the program AMJ, where 
lines (a) give the averages of the parameter estimates, lines (b) give the averages across simulations 
of estimated standard errors of the corresponding estimates for the 1000 replications, lines (c) the 
sample standard deviations of the corresponding estimates for the 1000 replications and lines (d) 
give percentages of simulations where we reject the hypothesis E^t significance level 5%. 


Sample size 


A'n 

= 0.8 

^'l2 

<2 = 0.5 

hs 

II 

to 

1 

O 

Sii 

Ell = 1 

^12 

Si2 = 0 

S 22 

E 22 = 1 

25 

(a) 

0.7481 

0.5014 

-0.8036 

0.9804 

-0.0026 

1.0760 


(b) 

0.2023 

0.1543 

0.2118 

- 

- 

- 


(c) 

0.2161 

0.1654 

0.2226 

- 

- 

- 


(d) 

7.1 

7.7 

4.8 

- 

- 

- 

50 

(a) 

0.7714 

0.5035 

-0.8410 

0.9845 

0.0022 

1.0392 


(b) 

0.1397 

0.1049 

0.1355 

- 

- 

- 


(c) 

0.1344 

0.1112 

0.1525 

- 

- 

- 


(d) 

4.5 

6 

6.6 

- 

- 

- 

100 

(a) 

0.7855 

0.4975 

-0.8650 

0.9927 

0.0065 

1.0224 


(b) 

0.0963 

0.0735 

0.0926 

- 

- 

- 


(c) 

0.0995 

0.0733 

0.0964 

- 

- 

- 


(d) 

5.4 

4.8 

5 

- 

- 

- 

200 

(a) 

0.7905 

0.4984 

-0.8905 

0.9959 

0.0037 

1.0087 


(b) 

0.0677 

0.0510 

0.0628 

- 

- 

- 


(c) 

0.0713 

0.0513 

0.0635 

- 

- 

- 


(d) 

5.4 

5.7 

4.2 

- 

- 

- 

400 

(a) 

0.7976 

0.5000 

-0.8932 

0.9946 

-0.0001 

1.0073 


(b) 

0.0474 

0.0358 

0.0440 

- 

- 

- 


(c) 

0.0471 

0.0363 

0.0448 

- 

- 

- 


(d) 

5.2 

4.7 

5.1 

- 

- 

- 


with the theory, 

• the sample standard deviation on line (b) becomes also closer to the averages 
across simnlations of estimated standard errors in line (c) showing that the 
standard errors are well estimated, and 

• the percentage of rejecting the hypothesis Hq is close to 5%. 

We compare, for the sample size n = 400, a histogram of the 1000 replications of 9i 
to the normal probability cnrve with mean eqnal to 9^ and standard deviation given 
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Figure 1: Time plots of the coefficients and the simulated tdVAR(l) generated by the process 
defined in (14.411 of length n = 400. 


in line (b) of Tabled! As we can see from the corresponding Figure [21 this histogram 
shows empirically consistency and normality of the estimates. 


5.2 Example 2: tdVAR(l) with heteroscedasticity 

We keep the bivariate model dehned by fl4.111 - 114.3p . with the same numerical values 
for a and b but without with g^et instead of and a covariance matrix Tjt{0) = 

gt{9)J2gf (9) bounded but time-dependent, where 

exp (-7711 sin(ct)) 1 \ 

-1 exp (-7722 sin(cf)) y 

with Ell = S 22 = 1, Ei 2 = 0.5, c = 27r/25 and t/h = 1, 7722 = —1 so that the 
innovation correlation coefficient varies between —0.8 and 0.8. 

Again, the number of replications is 1000. The results are presented in Table [2l 
Moreover a program for computing the asymptotic information matrix on the basis of 
the formulae given in Appendix IB.21 gave, for 77 = 50 for example, the standard errors 
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A11 



Figure 2: Histograms with the normal density function of fOOO replications of the parameters 
with n = 400. 

Table 2: Estimation results for the model dSl) under (I4.4I) - (I4.5I) and (I5.1I) . via the program AMJ, 
where columns (a) give the averages of the parameter estimates and columns (d) give percentages of 
1000 simulations where we reject the hypothesis at significance level 5%. 


Sample size 

(^11 

= 0.8) 

A’22 {A 22 — 

-0.9) 

hii (h?i 

= 1.0) 

h22 (h22 = 

-1.0) 


(a) 

(d) 

(a) 

(d) 

(a) 

(d) 

(a) 

(d) 

25 

0.7671 

3.8 

-0.8567 

4.5 

0.9897 

3.2 

-0.9848 

5.1 

50 

0.7808 

4.2 

-0.8766 

5.0 

0.9913 

2.9 

-0.9964 

5.8 

100 

0.7910 

4.4 

-0.8864 

5.8 

0.9977 

4.1 

-0.9975 

6.7 

200 

0.7963 

5.3 

-0.8931 

5.1 

0.9997 

5.3 

-1.0000 

6.4 

400 

0.7972 

6.3 

-0.8970 

4.2 

0.9980 

4.7 

-0.9984 

5.6 


0.0905, 0.908, 0.1995, 0.1995, whereas the averages of the standard errors estimated 
by the QMLE program were 0.0963, 0.1217, 0.2027, 0.1516, respectively, and the 
standard deviations of the 1000 estimates were 0.0917, 0.1227, 0.1879, 0.1587. 
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Appendices 


A Proof of theorems 

Since we proceed as in Azrak & Melard (2006), we just sketch the proof, at least 
in Section lA.ll Lemmas preceeded by TA refer to the Technical Appendix, Alj et 
ah (2015a). 

A.l Proof of Theorem 2.1 and of Theorem 2.2 

As mentioned in Section 12.21 we check the following four assumptions of Klimko & 
Nelson (1978) for Qn{0) dehned by (12.2p . 

Assumption Hi 1 0, for i = 1,..., m; 

I ) e=eo 

Assumption Hi, 2 n~^ | ^ | ^ 14^, for i,j = l,...,m, where V = 

(V^,) is a strictly positive dehnite matrix of constants; 


lim sup(nA) 

n->-oo 


-1 


r d^Qn(S) ] 


V 


} e=e* I 




) e= 


6»o 


Assumption Hi 3 

for i, j = 1, ...,m, where 9* is a point of the straight line joining 9^ to 
||0-0°|1 < A, 0 < A. 


< 00 a.s., 

6*, such that 


Assumption Hi ,4 n ■A/'(0, IL), where W = 

K ) 0=0^ 

is a strictly positive dehnite matrix. 


Remark A.l Assumption Hi ,3 coincides with H 2 , 4 . 


A.1.1 Proof of Hi 1 

By TA Lemma 4.4, we have that {dat{9)/d9i, Ft} is a martingale difference sequence. 
Then we can use a strong law of large numbers for martingale sequences (Stout, 1974, 
p. 154) since H 2 ,i implies the condition for it, more precisely 


E 


EqO 


dat(9) 

dBi 


P 


tl+P/2 


< 00 , 


(A.l) 
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for p = A. 


□ 


A.1.2 Proof of Hi 2 

By TA Lemma 4.5, we have {d'^at{9)/dOidOj — at{9)/dOidOj], Ft} is a martingale 

difference sequence. Then we can again use the Stout (1974) strong law of large 
numbers, by adapting flA.lD . since H 2,2 implies the condition for it for p = 2. Hence 



Then one half of the a.s. limit of the hrst term is n ^d^Qn{9)/d9id9j for 9 = 9^ and 
dehnes Vij, by H 2 , 3 . □ 

Up to now, all the assumptions of Theorem 2.1 are verihed. To prove Theorem 
2 . 2 , there remains to check Hi, 4 . 

A.1.3 Proof of Hi 4 

We proceed by using the Central Limit Theorem for martingale difference sequences 
of Basawa & Prakasa Rao (1980). Using the Cramer-Rao device with any vector A, 
this requires to prove that, for ^t = ^^[dat{ 9 )/d 9 ]Q= 90 , is bounded. 

This is done again using H 2 ,i. Then, with W dehned by H 2 , 5 , asymptotic normality 
follows with the stated covariance matrix. □ 

A.2 Further preliminaries 

In order to prove Theorem 3.1, we need the following Lemma, due to Hamdoune (1995) 
(see also Azrak & Melard 2006), and a strong law of large numbers for mixingale 
sequences. 

Lemma A.l Let {wt,t = 1,...,?7,} be, for each n ^ N, a scalar process with finite 
second-order moments, i.e. 

Lemma [A.l\ -i: E{wl) < 00 

Lemma \Xl\ -ii: E Ym=i '^t) = 0{n~^) with <5 > 0. 

Then, n~^ converges a.s. to zero when n tends to infinity. 

We also need a strong law of large numbers for mixingale sequences, e.g. Hall & Heyde 
(1980, Theorem 2.21) in the special case where their sequence bn = n. Let us recall 
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the definition from Hall & Heyde (1980, Section 2.3). 


Definition A.l Let {w^,t > 1 } be square-integrable random variables on a probability 
space and {Ft,—oo < t < cxo} be an increasing sequence of a-fields of F. 

Then {w^, Ft} is a L 2 -mixingale sequence if for sequences of nonnegative constants 
and Ct where ^ as z/ —)■ oo, we have 

i. E{E{wt\Ft-ufi} < fiuCt, and 

ii. E{wt - E{wt\Ft+„))'^ < fju+ict ■ 

Lemma A .2 If {w^, Ft} is a L 2 -mixingale sequence, and if Ct < oo and fin = 
0 (n“^/^(log? 7 ,)“^) as n ^ oo, then n~^ Ylt=i as n ^ oo. 


A.3 Proof of Theorem 3.1 

First of all, as shown in Section 13.21 assumption H 3.1 is used to define the fi^s used 
in H 3.2 and in the derivatives in the other assumptions. The idea is to check the five 
assumptions of Theorems 2.1 and 2.2. Then H 2.1 and H 2.2 are direct consequences of 
TA Lemma 2.1 (using TA Lemma 4.11) and TA Lemma 2.2 (using TA Lemma 4.12), 
respectively. This makes use of assumptions H3 2 , H3 3, H3.4 and H3 5. 


A. 3.1 Proof of H2.3 

Let us consider the process defined by 


^tij - 




f dej (9) defie) 






By using TA Lemma 4.13 and assumptions H3.2, H3.4, Hs.s and H3.7, the two as¬ 
sumptions i and ii of Lemma A.l are fulfilled for hence 


1 

n 


E 



sr'(«) 


det{9) 

89, 



- Ego 





det{9) y 
89, J_ 


(A.2) 


when n —>■ cxo. However, from TA Lemma 4.5, we have for i,j = 1, ...,m: 


1 

— y ^00 

2n ^ 

t=i 



1 

n 



def {9) 
89, 




det{9) 

89, 


e=eo 
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1 

2n 




(A.3) 


0=00 


Then flA.2p implies that the a.s. limit of flA.Sp for n —)■ oo will be equal to 


n^oo ri 


dO, 


de, 




e=e° 


and this is Vij as dehned in H 


3 . 6 - 


A.S .2 Proof of H2.4 

By using TA Lemma 2.3, it suffices to show that 


lim — 

n—>00 n 


I d^at{6) 

^{dOidejdei 



< 00 


a.s. for i,j, I = 1 ,..., m. 


As a consequence of TA Lemma 2.4, the expression on the left-hand side can be 
bounded by 


$1 


n 


V $2* + - V ^n + - V + - V ^3t, 

^^ n ^^ n ^^ n ^^ 


(A.4) 


t=i 


t=i 


t=i 


t=i 


where $1 is shown in TA Lemma 4.14 (using assumptions H3.3 and 113 , 5 ) to be 
bounded, and the last four terms $ 2 *, T 20 and 1^34 are briefly described now: 

• $24 contains terms like ej{6) {9) / dOidO jdOi) e^{9)] 

• Tii contains terms like ef ( 0 ) / dOidOj) {de^{6)/dOi), ej{6) (9Ej“^(6*)/c}6*j) 

{de‘f{6)/d6jd6i), and ej{6) {de^{6)/d6id6jd6i), where i,j,l = l,...,m; 

• the last two terms ^ 2 t and contain respectively terms like {dej{6)/d6i) 

{d'E^^{6)/d6i) {de^{6)/d6j) and {d‘^ef{6)/d6id6j) {de^{6)/dOi), where 

iJJ = 1, ...,m. 
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For i,j, I = 1, let us define the following six sequences of random variables 


Af' = 


_ 


= 


W, 


(l)ijl 




de. 


89, 


89, 


— EqO 


e=eo 


8ef{9)8J:T\9)8eM 
89i 89. 




\ 89,89j 


89, 


— Ego 


e=eo 


j S=>E,-‘W p 


= (eU9) 


/ e=eo 

8^j:t\ 0) deM 

89,89j 89, 


8e^ {0)y-i(n, 


I 89j 

9eti9) 

89, 


We,(«) 


Ego 


e=eo 


ei (0) 


89,89j 89, 

d-‘Y.;\e)Be,(e) 


89,89j 89, 


and and defined similarly by replacing {8‘^T,~[^{9)/89,89j){8e^{9)/89,) 

with, respectively, {811^^{9)/89,){8el{9)/89j89i) and {9){8el{9)/89,89j89i). 

First, we have that E{Zt/Et-i) = 0 and, according to TA Lemma 4.15 (using 
assumptions H3.3, H3,4 and H3.5) , that E{Z^) is uniformly bounded by a constant. 
Then, the strong law of large numbers (Stout, 1974, p. 154) implies that 


n 

t=\ 


The arguments for the other sequences are more involved. From TA Lemmas 4.16, 
4.18 and 4.20 we have that E,}, q = 1,2,3, {Xf^E,} and {Y^^\Et} are 

L 2 -mixingale sequences. From TA Lemmas 4.17, 4.19 and 4.21 we have that these 
L 2 -mixingale sequences F^}, q = 1,2,3, F^} and {Y^^\Et} fulhl the 

conditions in Lemma A.2, the strong law of large numbers for a mixingale sequence 
(Hall & Heyde, 1980, p. 41, Theorem 2.21). This makes use of assumptions H 3 , 2 , 
H3.3, H3.4 and H3.5. Hence 


n 


-1 




a.s. ^ 

^ 0 , 


t=i 


n 


-1 




a.s. ^ 
^ 0 , 


t=l 


n-^ 0 , 

t=l 


n n 

n~^ ^ Xy 4 0 , and n~^ ^ Y^ 4 0 , 

t=i t=i 


and this for i,j, I = 1 ,..., m. 

Consequently, the proof is completed so that H 2.4 is checked. Then for every 
e > 0, there exists an event F with Pgo{E) > 1 — e and an uq such that, for n > Uq 
on F, Qn{9) reaches a relative minimum at the point 9n- Consequently, there exists 
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an estimator 9n such that ^ as n —)■ cx). 


A.3.3 Proof of H 2,5 


From TA Lemma 4.10 we can determine the explicit form of the left-hand side of H 2.5 
for all 1 < < m: 


4 

n 


E 



det{e) 

de. 



- Ego 



\'-W\ 

I J 6»=6»0 \ / . 



^i\0) 


do, )_ 


K, 


tj 



t=l 


. I J 6=00 V / . 


K, 


tA") 


(A.5) 


where Kt^i is dehned in TA Lemma 4.10. While checking H 2 , 3 , we have shown that 


4 

n 


E 


def {9) 

89, 


^i\0) 


deM 

89, 




deM V 
89, ) 


^ 0 . 


There remains to prove that the second and third terms of (1A.5I1 also tend a.s. to zero. 
To achieve that, let us consider 




\ def (9) \ / def{9) \ 

1 f e=eo \ / 




t.i 5 


for i,j = 1, Then, by TA Lemma 4.22 (involving assumptions H 3 , 4 , H 3.5 

and 113 , 7 ), the two assumptions i and ii of Lemma A.l are verihed, entailing that the 
last two terms of (lA.Op also tend to zero almost surely. 

As a conclusion, the asymptotic convergence of the estimator 9n towards the nor¬ 
mal distribution is ensured and the proof of Theorem 3.1 is achieved. □ 


B Assumptions checking 

We check the assumptions of Theorem 13.11 for the two examples of Section 01 hereby 
providing a theoretical foundation for the simulation results of Section [5l Since several 
of the assumptions are somewhat similar, we have only covered once each argument. 
Also, since Example 2 is a generalization of Example 1, we have avoided to repeat 
some of the verihcations when they are too similar. 
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B.l Example 14.11 

B.1.1 Assumption H 3 i 

Trivial 

B.l.2 Assumption H 3 2 

In order check this hypothesis, we shall have recourse to the results of Section 13.21 In 
this example the coefficients of the pure moving average representation of (14.Ih are 
given by 

k-l 

1=0 

for A; = 1, 2,f — 1. The coefficients of the pure autoregressive form are 


nti{e) = At{e), 


and 7itk{0) = 0 if A; = 2,f — 1. 

Then, by using fl3.10p - fl3.12p . we can calculate 'ipuk{(^,(^^), '<Ptijk{(^-,(^^) and 
'4’tijik{0,9^). For example, denoting = Y\dZi and its (z,j) element 

hi = 1,2, we have 


fe(9,9”) = %^hnA-,(9“) = 

* i=l 

9^t{9) ^(k-i) 

39, ‘ ’ 

(B.l) 

i^uAe.e)- llA,_i{9) - 

* ^ 1=1 

d9id9, * ’ 

(B.2) 

^ ^ ^ ) - 
* ^ ‘ Z=1 

3^ At{9) .(fc-i) 
39,39j 391 * 

(B.3) 

for k = l,2,...,f — 1, where A® = I 2 . Obviously, checking the assumptions in the 
general setup happens to be a complicated and tedious task, hence, for the sake 
of simplification, we shall consider the model defined in (|4.3p. It can be shown bv 
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induction that 


A 


(k-i) 


(k-i) (k-i) 


0 A 


(k-i) 

t,2,2 


(B.4) 


for k >2, where 


fc-i 


k-l 


= i^u)^ ^ JJsin(a(t - /)), ^ JJsin(6(t - /)), (B.5) 


1=1 


1=1 


k-l 


k-2 




(B.6) 

1=1 f=l 

and cif = a and 5;/ = 0, for / + / < fc — 1, and q/ = b and 5// = 1, for / + / > fc — 1. 
For example, for k = 4 

^ (^(^ii)^ sin(a(t — 1)) sin(a(t — 2)) + sin(a(t — 1)) sin(6(t — 3)) 

+ (^ 22 )^ sin(6(t — 2)) sin(6(t - 


For ^ = 1, i.e. 9i = and using fIB.dp we have 
Aik = 


sin (at) 0 

0 0 


4(^-1) 4(^-1) 

^t,l,l ^t,l,2 


0 


(fc_l) (fc-1) 

• ^ I ^t,l,2 

= sin(at) ' 


0 


0 


(B.7) 


where a dash will always indicate an element that will not be used. By the same way 
for 02 = A'o- 


^22 


A2k = 


0 


0 


We dehne the constant 


0 sin(6t) 


0 A 


(k-l) 

t,2,2 


= sin(bt) 


0 0 
0 dS’ 


(B.8) 


$1/2 _ 


max 


a'° 

^11 


A 


(B.9) 


which is such that 0 < <F < 1. Now by using (1B.5[1 - (]B.9[1 and bounding sines by 1, we 
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can show that 




(B.IO) 


<{k- l) 4 t 


and 


-4|S' < 


1 


t2k\\i = [Mbt)Ai^2:2'’]' ^ 

k=i' k=u 


(B.ll) 


(B.12) 


Consequently, from fIB.SD and flB.12D . 

Elk', 

k=i' k=u k=u 

where Ni = 1/(1 — d)). By the same way, from flB.7p and flB.lOp - OB.lip . 

'^UtikWl = 5^sin2(af) +(^£ 72 ^^) (B.13) 

k=u k=iy *- -* 

t-l t-l 

k=u k=u 

but this cannot be bounded by for some constant iVi, independently of v, so 

a more subtle upper bound needs to be found. Using fIB.bp . an upper bound of the 
element (1,2) is equal to 


k—1 k—2 

Ei,/2-l ^ p[sin(c,^(i - / - i,,)). (B,14) 

1=1 f=l 

It should be possible to adapt some results on products of sines (e.g., Freiman & 
Halberstam, 1988, and Janous & King, 2000) to the general case. More precisely, 
let Qk = maxp >2 n£i si^(27rt/P), then \im.k=>.oo{QkY^^ = 0.6098579.... But it will 
be tricky given the specihc form (IB. 1411 . and we will restrain the proof by assuming 
a sufficient (but not necessary) condition that a = 2 tt/Pi and b = 27 T/P 2 , for some 
strictly positive integers Pi and P 2 - Then sin(a(t — /)) = 0, for t — / = giPi, for 
gi G Z, and sin(6(f —/ —1)) = 0, for f —/ —1 = 5 ' 2 -P 2 , for g 2 G Z. Let k Pi + P 2 + 1- 
Then, for any given t, take the remainders gi and g 2 of the Euclidean divisions of t by 
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Pi and of t — 1 by P 2 , respectively. These remainders are between 0 and, respectively, 
Pi — 1 and P 2 — 1, hence smaller than k — 2. Hence for any k > k, there is some integer 
/, either t —^fiPi, or t — l—g 2 P 2 between 0 and k — 2 such that either sin(a(t —/)) = 0 
or sm{b{t — / — !)) = 0 and hnally each term of fIB.ldj) vanishes. Hence, taking u > k, 
the second term of flB.131) vanishes for all t and the Frobenius norm can be bounded 
by some expression for some A^i. Note also that, under the conditions of 

integral periods, we have 


W'^’tikWp = \\^t2k\\F = 0, k>k. 


By using flB.lD - flB.2D - flB.3p the other inequalities of this assumption are checked. 


B. 1.3 Assumption H3 3 
Trivial. 


B. 1.4 Assumption H3 4 
Trivial. 


B. 1.5 Assumption H3 5 
Trivial. 


B. 1.6 Assumption H3 6 

The second term in H3,6 is equal to 0 , so it remains to show that 



for i,j = 1,2, where the matrix V = (l^,j)i<jj <2 is a strictly positive dehnite matrix. 
From 03.71) we have 


det{9) 

dOi 


t-i 

k=l 
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entailing, since we have taken S = / 2 , 


'J0O 




V 00 


001 


= Ego 


' t-1 


t-l 


'^'iptik{0,0^)(^t-k\ s ^y^'0tifc(6',6'°)6t_fc 


.fc=l 


k=l 


t-l 


sin^(at) tr 


fc=i 


(fc-i) .(fc-i) \'r / .(fc-i) .(fc-i) 


0 


0 


0 


0 


t-l 


= sin at 


k=l 


t)Y, h'tu" + 


(B.15) 


The terms for k > k vanish, as explained above. The remaining terms are bounded by 
also as above, and the sum is strictly positive, at least for some f’s. Therefore, 
taking the average for f = 1 to n gives a finite strictly positive limit Vn when n —>■ oo. 
By the same method for i,j = 2 


Ego ( ^^ti^) ^-idet{0) 


V 002 


000 


= Ego 


t-l 


t-l 


k=l 


J2^t2ki0,0yt-k ) J:-^J2^t2ki0,0yt 

k=l 

^sin^(&f) tr 

k=l 

sin^ (bt) (A% 


t-l 


0 0 

0 


0 0 

0 a^j:2 :2^ 


:,2 


k=l 


with a similar consequence to provide a hnite strictly positive limit V 22 when n ^ 00 . 
Furthermore, it can easily be seen 


Vio — V21 — 0 . 


Consequently, the matrix V = (14j)i<jj<2 is strictly positive dehnite. 

Remark B.l If we handle the model described in Section [31, but without the param¬ 
eter A'^ 2 ! ^be case where the number of observations is 25, we obtain the follow¬ 
ing standard errors from that theoretical formula for the two remaining parameters: 
0.2175 and 0.2303, respectively, which largely agree with the averages drawn from the 
simulation results: 0.2161 and 0.2226, as shown in TableUi 
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B. 1.7 Assumption H3 7 

For the first part of this assumption, since gt is bounded, it remains to show that 

^ n—1 n—d t—1 

d=l t=l k=l 

For i = 1 

t-i 

\\'4’tik\\F\\'4’t+d,i,k+d\\F = I sin(at) sin(a(t + d))| 

k=l 

t-1 

xE 

k=l 

(B.16) 


Ak-l) 


+ 


Ak-1) 


1/2 


A (k+d—l) 


+ 


4 (^k-\-d— 


-1) 

2 


1/2 



To simplify the proof we assume again that a = 2n/Pi and b = 271/P 2 , for some 
integers Pi and P 2 and use k as dehned above. Then flB.lbl) becomes a sum for k = 1 
to k — d. 

The general term can be bounded by + {k — x) 2 ^ji /2 

which is of order Hence 


n—1 n—d 

EEEJi = 


d=l t=l 


< 

< 

< 


n—1 n—t 

EEiEii 

t=l d=l 
n—1 n—t 

EE*. 

t=i d=i 


$ 1/2 


*■(*)(! _4i/.)2 

k2{^)n, 


n—1 


El 


where fci(‘h) and k 2 {^) are constants and where we have used the formula = 

x/{l — x)^, provided |a;| < X. Dividing by thus gives 0(X/n), as requested. 

Applying the same method for i = 2, with again a sum for k = 1 to k but of 
2 '^ instead of the product of square roots of sums of squares in (IB.Xbp . 
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with a general term bounded this time by i)/2<j)(fc+c* i)/ 2 ^ show that 


n—1 n—d t—1 


d=l t=l k=l 


F 


V’ 


(n) 

t-\-d^‘2,k-\-d 


F 


< 

< 


n—1 n—1 




h{^)n, 


with other constants fc 3 (‘h) and fc 4 (‘h) and reach the same hnal conclusion as for i = 1. 

For the second part of this assumption, since St(E), E 0 E and vec(E) vec(E)^ 
are hnite constants, similarly to the hrst part we can show that the second part of 
assumption H 3 7 is fulhlled. 


B.2 Example 14.2 

The assumptions H 3 . 1 —H 3.5 are easily checked: H 3 ,i is trivial, H 3.2 remains unchanged 
but requires the same conditions on integers Pi,P 2 as for the previous example, and 
H 33 -H 35 are readily checked given boundedness of the sine function. It remains to 
discuss about H 3,6 and H 3.7 which we are going to check now. Note that from (14.dh 
the vector of the parameters is 6^ = {A\^^A' 22 -i riii,rj 22 Y and 0° = ?7?i, ^ 722 )^ 

is its true value. Note that we assume again that <F is of the form flB.9D . Finally, we 
write sii, S 12 and S 22 the entries of the matrix E (without loss of generality, we could 
as well take E the identity matrix, as was done for the proof of the previous example). 


B. 2.1 Assumption H3 6 

We start by making an important observation, namely that the matrix V = ( 14 ,j)i<i,j <4 
is block-diagonal, with blocks for (i,j) E {1,2} and (i,j) E {3,4}, respectively. 

Regarding the second block, tedious calculations (therefore carried out with Math- 
ematica) for 6*3 = rjn yield 


tr 


d03 f 

- S 12 )' + 2(siiS22 - sf2) 




= 2 sin^(cf)- 


(1 -f (S11S22 - sfa) 


= V^33(t), 


where Vij(t) is the term t in the sum dehning Vij, i,j = 1, 2, 3,4. Since E is an invert¬ 
ible matrix, S 11 S 22 —= det(E) > 0, hence this expression is clearly positive. Bound¬ 
ing the numerator is straightforward (since | sin(cf)| < 1 ), and 1 / (l -|- e(^ii+'^ 22 )sin(ct )^2 
can simply be bounded by 1. Hence, there exist a constant a not depending on t such 
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that V 33 (t) < a, consequently the entry V 33 is positive and hnite. The same conclusion 
obviously holds for V 44 . Turning our attention towards V 34 , we obtain 


tr 



dOs 




864 



. 2/ X (si2 (e^22sin(ct)g^^ _ _ e(^ll+’?22)«in(rf)(si^S22 “ 

\ J_ 1 _ L 

(1 + e(’?ii+^ 22 )«i"hb )2 (S11S22 - s?2) 


^34(t). 


Again, it is a simple exercise to show that this term is bounded independently of t. 
Let us turn our attention towards the block (1, 2) now. From fl3.7p we know that 


deUd) 

86, 


t-i 


5]fc(9,9“)9: 


t—k^t—k • 


k=l 


Following Section [6.1.61 for i,j = l with di = we obtain a generalized version of 
(IB.ISP as 


Ego ( 


V 86 


861 


= Eqo 


't-i 


t-i 


^'ijjtik{6,6°)g-, 


t—k^t—k 




t—k^t—k 


,k=l 
t-1 

= sin^(at) tr 

k=l 


k=l 


4 (^- 1 ) 4 (^- 1 ) 

0 0 




t—k 


4 (^- 1 ) 4 (^- 1 ) 

0 0 


an expression clearly bounded (under the same conditions as those in B.l). Its posi¬ 
tiveness follows from the fact that we consider a Mahalanobis distance in the metric 

Finally, checking that the blocks (1,2) and (3,4) are positive dehnite has been 
done numerically in the numerical example of Section 5.2. 


B. 2.2 Assumption H 3 7 

In the hrst part of this assumption we have to show that 


n" 


n—1 n—d t—1 

d=l t=l k=l 


t—k 


Vtik 


r t-\-d,i,k-\-d 



From fl4.6ll we have 






























which is an easily bounded function. Consequently, we are left with exactly the same 
expression as for Example 4.1, which solves the question. 
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